Processor-In-Memory System Simulator
نویسنده
چکیده
The Problem: The extent of previous work on processor in memory systems using merged DRAM-logic process technology [1, 2, 3, 4] indicates that the technology has great promise. Most of the results to date, however, have been architectural studies with simulation results or proposed architectures. Relatively few designs make it to final silicon due to the prohibitively high Non-Recurring Engineering (NRE) costs of this new process technology. Of the designs that do make it to silicon, many have found that a merged DRAM-logic process is fickle: subtle noise or leakage problems can lead to a non-working DRAM array. This difficulty in realizing PIM architectures in silicon has left the research community with a lack of real platforms for compiler and application development, and thus there is much uncertainty about our ability to efficiently utilize these typically fine-grained parallel processor systems.
منابع مشابه
Earth Simulator Running
The Earth Simulator (ES) is a distributed memory parallel system which consists of 640 processor nodes connected by a single stage full crossbar network. Each processor node is a shared memory system composed of eight vector processors and a memory system of 16 GB. The total peak performance and main memory capacity are 40 Tflops and 10 TB, respectively. Its development was initiated in 1997 an...
متن کاملThe Earth Simulator System
1-1 1-2 1-3 & 2-1 2-2 2-3 & 3-1 The Earth Simulator System By Shinichi HABATA,* Mitsuo YOKOKAWA† and Shigemune KITAWAKI‡ *Computers Division †National Institute of Advanced Industrial Science and Technology (AIST) ‡Earth Simulator Center, Japan Marine Science and Technology Center The Earth Simulator, developed by the Japanese government’s initiative “Earth Simulator Project,” is a highly paral...
متن کاملAn Algorithm for Register-Synchronized Precomputation In Intelligent Memory Systems
This paper presents a novel compiler algorithm for selecting program slices that prefetch load values concurrently with program execution. The algorithm is evaluated in the context of an intelligent memory system. The architecture consists of a main processor and a simple memory processor. The intelligent memory system pre-executes program slices and forwards values of critical loads to the mai...
متن کاملModelling Time in Full System Simulation
Simulation of computer systems is becoming increasingly important in areas such as code optimization, performance evaluation and processor and memory system design. An important trade-off in simulator design is performance vs. abstraction of the model. Simics is a full system simulator, capable of simulating an entire computer system, including the processor, TLB, MMU, memory hierarchy and surr...
متن کاملMidas: An FPGA-based Architecture Simulator for Multiprocessors
We present Midas, an economical FPGA-based architecture simulator that allows rapid early design-space exploration of manycore systems. Midas models target-system timing and functionality separately, and it employs hostmultithreading for an efficient FPGA implementation. It is a high-throughput, cycle-accurate full-system simulator, capable of booting real operating systems. The Midas prototype...
متن کاملA High Performance Parallel IP Lookup Technique Using Distributed Memory Organization and ISCB-Tree Data Structure
The IP Lookup Process is a key bottleneck in routing due to the increase in routing table size, increasing traıc and migration to IPv6 addresses. The IP address lookup involves computation of the Longest Prefix Matching (LPM), which existing solutions such as BSD Radix Tries, scale poorly when traıc in the router increases or when employed for IPv6 address lookups. In this paper, we describe a ...
متن کامل